Wald for non-stopping times: The rewards of impatient prophets
نویسندگان
چکیده
Let X1, X2, . . . be independent identically distributed nonnegative random variables. Wald’s identity states that the random sum ST := X1 + · · · + XT has expectation ET ·EX1 provided T is a stopping time. We prove here that for any 1 < α ≤ 2, if T is an arbitrary nonnegative random variable, then ST has finite expectation provided that X1 has finite α-moment and T has finite 1/(α− 1)-moment. We also prove a variant in which T is assumed to have a finite exponential moment. These moment conditions are sharp in the sense that for any i.i.d. sequence Xi violating them, there is a T satisfying the given condition for which ST (and, in fact, XT ) has infinite expectation. An interpretation is given in terms of a prophet being more rewarded than a gambler when a certain impatience restriction is imposed.
منابع مشابه
An analytical study of purification in Quran relying on Al-Mizan commentary
The word purification and its derivatives have been mentioned 25 times in Quran. Purifying one`s soul is of essential effect in human salvation, Quran, therefore, has considerably attended to it. The reason why lies in the fact that without purifying human soul from deviations in religious belief, morality and behavior one never would get happiness. Having presented semantic of the word and def...
متن کاملANALYSIS OF A DISCRETE-TIME IMPATIENT CUSTOMER QUEUE WITH BERNOULLI-SCHEDULE VACATION INTERRUPTION
This paper investigates a discrete-time impatient customer queue with Bernoulli-schedule vacation interruption. The vacation times and the service times during regular busy period and during working vacation period are assumed to follow geometric distribution. We obtain the steady-state probabilities at arbitrary and outside observer's observation epochs using recursive technique. Cost analysi...
متن کاملOptimal Stopping for Non-linear Expectations
We develop a theory for solving continuous time optimal stopping problems for non-linear expectations. Our motivation is to consider problems in which the stopper uses risk measures to evaluate future rewards.
متن کاملOptimal stopping for non-linear expectations—Part I
We develop a theory for solving continuous time optimal stopping problems for non-linear expectations. Our motivation is to consider problems in which the stopper uses risk measures to evaluate future rewards. Our development is presented in two parts. In the first part, we will develop the stochastic analysis tools that will be essential in solving the optimal stopping problems, which will be ...
متن کاملMaximum reward reinforcement learning: A non-cumulative reward criterion
Existing reinforcement learning paradigms proposed in the literature are guided by two performance criteria; namely: the expected cumulativereward, and the average reward criteria. Both of these criteria assume an inherently present cumulative or additivity of the rewards. However, such inherent cumulative of the rewards is not a definite necessity in some contexts. Two possible scenarios are p...
متن کامل